GMATo: A novel tool for the identification and analysis of microsatellites in large genomes

نویسندگان

  • Xuewen Wang
  • Peng Lu
  • Zhaopeng Luo
چکیده

UNLABELLED Simple Sequence Repeats (SSR), also called microsatellite, is very useful for genetic marker development and genome application. The increasing whole sequences of more and more large genomes provide sources for SSR mining in silico. However currently existing SSR mining tools can't process large genomes efficiently and generate no or poor statistics. Genome-wide Microsatellite Analyzing Tool (GMATo) is a novel tool for SSR mining and statistics at genome aspects. It is faster and more accurate than existed tools SSR Locator and MISA. If a DNA sequence was too long, it was chunked to short segments at several Mb followed by motifs generation and searching using Perl powerful pattern match function. Matched loci data from each chunk were then merged to produce final SSR loci information. Only one input file is required which contains raw fasta DNA sequences and output files in tabular format list all SSR loci information and statistical distribution at four classifications. GMATo was programmed in Java and Perl with both graphic and command line interface, either executable alone in platform independent manner with full parameters control. Software GMATo is a powerful tool for complete SSR characterization in genomes at any size. AVAILABILITY The soft GMATo is freely available at http://sourceforge.net/projects/gmato/files/?source=navbar or on contact.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Utilization of a 17 Microsatellites Set For Bovine Traceability in Czech Cattle Populations

For identification of individuals and parentage control performed by cattle breeders in the Czech Republic, a novel Finnish Bovine Genotypes™ Panel 3.1was amplified by means of one multiplex polymerase chain reaction. Bovine Panel encompasses all the 12 STR loci recommended by the International Society for Animal Genetics (ISAG) for routine use in parentage testing and identification, including...

متن کامل

شناسایی RNA های غیرکدکننده کوتاه ‌عملکردی با استفاده از روش های بیوانفورماتیکی در گوسفند و بز

MicroRNAs (miRNAs) are small non-coding RNAs that have functional roles in post-transcriptional modification. They regulate gene expression by an RNA interfering pathway through cleavage or inhibition of the translation of target mRNA. Numerous miRNAs have been described for their important functions in developmental processes in numerous animals, but there is limited information about sheep an...

متن کامل

Identification of microRNAs in corpus luteum of pregnancy in buffalo (Bubalus bubalis) by deep sequencing

This study was aimed to identify miRNAs of corpus luteum (CL) in buffaloes during pregnancy. For this study, CL (n=2) were collected from gravid uteri of buffalo and RNA was isolated. Following this, the purity and integrity of RNA was checked and used for deep sequencing using Illumina Hiseq 2500 platform. The reads’ quality was checked prior to in silico analyses viz. identification of conser...

متن کامل

کاربری پروتیین‌های جدید در ساخت واکسن استافیلوکوکوس اورئوس

Background: Staphylococcus aureus and Staphylococcus epidermidis are major human pathogens of increasing importance due to the spread of antibiotic resistance. Novel potential targets for therapeutic antibodies are products of staphylococcal genes expressed during human infection. Previously, the secreted and surface-exposed proteins among seroreactive antigens have been discovered. Furthermore...

متن کامل

Analysis of c.3369+213TA[7-56] and D7S523 microsatellites linked to Cystic Fibrosis Transmembrane Regulator.

  Cystic fibrosis (CF) is a life-limiting autosomal recessive disorder affecting principally respiratory and digestive system . It is caused by cystic fibrosis transmembrane conductance regulator (CFTR) gene mutation. The aim of this study was to determine the extent of repeat numbers and the degree of heterozygosity for c.3499+200TA(7_56) and D7S523 located in intron 17b and 1 cM proximal to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2013